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Abstract 

Providing security guarantees for wireless communication is critically important for today's applications. While 
previous work in this area has concentrated on radio frequency (RF) channels, providing security guarantees for RF 
channels is inherently difficult because they are prone to rapid variations due small scale fading. Wireless optical 
communication, on the other hand, is inherently more secure than RF communication due to the intrinsic aspects 
of the signal propagation in the optical and near-optical frequency range. In this paper, secure communication over 
wireless optical links is examined by studying the secrecy capacity of a direct detection system. For the degraded 
Poisson wiretap channel, a closed-form expression of the secrecy capacity is given. A complete characterization of 
the general rate-equivocation region is also presented. For achievability, an optimal code is explicitly constructed by 
using the structured code designed by Wyner for the Poisson channel. The converse is proved in two different ways: 
the first method relies only on simple properties of the conditional expectation and basic information theoretical 
inequalities, whereas the second method hinges on the recent link established between minimum mean square 
estimation and mutual information in Poisson channels. 
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I. Introduction 

PROTECTING information flow from unauthorized access is of vital importance for today's applications. 
The concept of information secrecy however is not new and dates back to the pioneering work of Shannon 
ifTl . Shannon considered a strong notion of secrecy requiring essentially that the eavesdropper's received signal be 
independent of the communicated message. Under this constraint Shannon showed that the transmitter must share 
a private key with the legitimate receiver whose entropy is at least as large as the message's entropy. This negative 
result lead to the development of modern cryptographic systems whose objective is not to provide strong secrecy 
but rather to make security breaches computationally prohibitive. These encryption algorithms are employed today 
in most systems where secrecy is required. This wide deployment, however, should not obscure the fact that these 
methods do not offer strong security guarantees. Information-theoretical security can provide such guarantees and 
as such has attracted considerable attention lately. 

The pessimistic result of Shannon is due to the implicit assumption that the eavesdropper observes the same 
signal as the legitimate receiver. In his celebrated paper [2], Wyner challenged this assumption by introducing 
the wiretap channel, a channel in which an eavesdropper observes a degraded version of the signal received by 
the legitimate receiver. Wyner also considered a different notion of secrecy which requires that asymptotically the 
equivocation of the eavesdropper about the transmitted message should converge to the message's entropy. With this 
new framework, and for discrete memoryless channels (DMCs), Wyner gave a complete description of the tradeoff 
between the information rate at the legitimate receiver and the information leaked to the eavesdropper. Wyner 
showed that for a DMC there exists an intrinsic quantity called the secrecy capacity which gives the maximum 
information rate that can be reliably transmitted to the legitimate receiver with zero leakage of information to the 
eavesdropper. Since then, the wiretap channel has become one of the core topics in information theoretic security 
and several results are available today. Csiszar and Komer [Si extended Wyner's study to nondegraded DMCs. The 
degraded Gaussian wiretap channel was studied in H. Multiple input multiple output (MIMO) Gaussian wiretap 
channels have been the subject of an extensive interest lately Il5l-||7l. Wiretap channels in the presence of fading 
have been also investigated in ||8l-|l9l (and the references therein). 

The results of ||3l show that a non-zero secrecy capacity requires that the legitimate receiver's channel be less noisy 
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than the eavesdropper's. For RF channels, however, this is difficult to guarantee in practice due to the possibility 
of multipath fading. Indeed, even if the legitimate receiver is closer to the transmitter than the eavesdropper is, the 
legitimate receiver may still have a weaker channel due to fading. Moreover, since the fading state is a sensitive 
function of the position of the receivers, it is difficult to predict the degree of fading experienced by the legitimate 
receiver and the eavesdropper given imperfect information about their locations. 

One possible solution to this problem is to use optical or near-optical frequencies instead of RF. For optical 
wireless systems, the detector is usually multiple orders of magnitude larger than the wavelength of the transmitted 
beam, which provides natural immunity against multipath fading via spatial diversity |10|. This immunity makes 
predictions about the quality of the legitimate receiver's and the eavesdropper's signal based on their position more 
accurate. In fact, with the multipath problem gone, the only major channel impairment remaining is the pathloss 
which can be safely assumed to be higher for the eavesdropper if the legitimate receiver can guarantee that he is 
closer to the transmitter. 

Another advantage of optical communications over RF is that the transmitted signal is highly directional, making 
interception by a malevolent third party more difficult. This should be contrasted with the relatively-omnidirectional 
nature of RF transmissions, for which the signal is broadcasted over a wide angle. Yet another advantage of wireless 
optical communication is the spatial confinement of the transmitted optical signal. Indeed, at optical wavelengths, 
the transmitted signal is absorbed by the atmosphere and beyond a certain range it becomes undetectable. This is 
desirable from a security standpoint as an eavesdropper located beyond this range is literally kept in the dark. 

All of these features give wireless optical communications a clear advantage for security. This technology is 
already being deployed in the form of infrared communications |[T]l - |[T2l . and ultraviolet (UV) systems are currently 
under development |[T3l - |[T4l . However, despite the numerous benefits that this technology offers, coding is still 
needed for those scenarios in which the channel itself does not provide absolute secrecy, such as when the beam 
of light is reflected or scattered by solid objects, dust or water droplets |[T5l . Although in this case the signal has 
been degraded, an eavesdropper could still gain valuable information. 

We examine the fundamental limits of coding for secure communication over optical channels by studying the 
secrecy capacity of the Poisson channel, a common model for direct detection optical communications systems. In 



such systems the transmitter sends information by modulating the intensity of an optical signal while the receiver 
observes the arrival moments of individual photons. The capacity of this channel has been determined under 
peak power constraint on the transmitted optical power by Kabanov |fT6l and under both average and peak power 
constraint by Davis ifTTl . Wyner ifTSl derived the reliability function of this channel for all rates below capacity 
and constructed exponentially optimal codes. Multiple-access Poisson channels were studied in ||T9l - |[20l whereas 
broadcast Poisson channels were considered in |[2ll and |[22l . The capacity of the Poisson channel has been also 
investigated in the presence of fading |[23i . 

We study in this paper the degraded Poisson wiretap channel. The legitimate receiver observes a doubly stochastic 
Poisson process with instantaneous rate AyXt + Aj^ where {Xt,0 < t < T} is the signal transmitted. The 
eavesdropper's observation is also a doubly stochastic Poisson process with instantaneous rate AzXt + Xz- For 
degradedness we assume thajj Ay > A^ and < ^A^. In Theorem 1 we provide a closed form expression of the 
secrecy capacity as a function of the parameters (A^, A„), u G {y, z\. This result is further extended by Theorem 
5 which gives a full characterization of the rate equivocation region. 

Our achievability proof uses stochastic encoding as well as the structured codes constructed by Wyner for the 
Poisson channel ifTSl . As for the converse, we will see that the infinite bandwidth nature of the Poisson channel 
makes it possible to prove the converse using only simple properties of the conditional expectation combined 
with basic information theoretical inequalities. This is to be contrasted with the converse of the (finite bandwidth) 
Gaussian channel which is proved using the entropy power inequality (EPI). As an illustration for the basic ideas that 
underpin the converse for the Poisson channel, we will start by considering here the more familiar infinite bandwidth 
Gaussian channel and we will see also that for this channel the proof of the converse simplifies considerably. 

For this purpose, consider the continuous time Gaussian wiretap channel with bandwidth B (later we will let B 

tend to infinity) and with a power constraint P. This continuous time channel is equivalent to 2B uses per second 

of the discrete time Gaussian channel depicted in Fig. 1. The input signal is power constrained, i.e., E[X^] < P, 

the legitimate receiver observes Y = X ^ W\ and the eavesdropper receives Z = Y ^ W2 = X + Wi + W2, where 

Wi ~ Af{0, N,B) and Wi AL W2. 

'These conditions were shown to be sufficient for degradedness in 1211 . The argument is reproduced in Lemma 1 below. 



5 



Wi 



W2 



X 



Y 




Z 



Fig. 1. The discrete time Gaussian wiretap channel 



Define iV by ^ = ^ 



and observe that 




(1) 



where W = Wi- ^T4^2- It is easy to see that W 7\A(0, NB) and £[^^(^"1 + W's)] = 0, it follows therefore that 
_LL + W2 (since they are jointly Gaussian). For the discrete time Gaussian wiretap channel, it is known that 
the secrecy capacity is given by maXp^(I(X;y) — I{X;Z)). For the continuous time channel counterpart with 
bandwidth B, the secrecy capacity becomes Cf = 2B maxp^ iH^'^ ^) ~ H-^'j ^))- 

In m, using the celebrated EPI, a closed form expression for the secrecy capacity of the discrete time Gaussian 
wiretap channel was obtained. In just a few steps, we will see that the secrecy capacity of the infinite bandwidth 
{B 00) Gaussian wiretap channel can be found much more simply. Starting with ([U we obtain the following 
sequence of inequalities 



Inequality (a) follows from the data processing inequality, equalities in {h) and (e) are standard information 
theory identities, (c) follows from the independence bound on entropy and finally {d) holds because X + W and 
X+W1+W2 are conditionally independent given X (i.e., {X+W) _LL {X+Wi+W2)\X). Basically, the key identity 
needed to go from (a) to (e) is the following: if Yi _LL Y2\X, then we have I{X]Yi,Y2) < I{X;Yi) + I{X]Y2). A 



IiX;Y)=IiX;X + j;^^Z + W) 
(«) 

< I{X;X + W,Z) 



h{X + W,Z)- h{X + W, Z\X) 



< h{X + W) + h{Z) - h{X + W, Z\X) 



(d) 



h{X + W) + h{Z) - h{X + W\X) - h{Z\X) 



I{X-X + W)+I{X-Z), 



proof of this simple inequality in a more general setting will be given later and will be used in part of the converse 
for the Poisson channel. Going back to the Gaussian problem, we see that 

Cf = 2Bma.x{I{X;Y) - I{X;Z)) < 2BmaxI{X;X + W) = Bln{l + -^). (2) 

Px Px BN 

For a fixed bandwidth B, this last inequality is not tight. Now letting i? — )• oo we obtain 

CT = lim C7f < i = ^ - — (3) 

However, since maXp^(/(X; Y) — I{X; Z)) > maxp^ H-^'-: ^) ~ maxp^ I{X; Z), we also have that 

/ P P \ P P 
C^> lim rBln(l + )-B\n{l + — -) = . (4) 

It follows that C~ = ^ 



This remarkably simple approach will be particularly useful for the Poisson channel. More specifically, when 



= the eavesdropper's signal Z is a thinned version of the legitimate receiver's signal Y, i.e.,n Y = Z + Z 



where Z _LL Z\X, the approach above gives that I{X; Y) — I{X; Z) < I{X; Z). Since Z is itself a doubly stochastic 
Poisson process, the mutual information I{X;Z) can be maximized using the martingale techniques of Kabanov 
|[T6l and an (achievable) upperbound can be obtained on I{X; Y) — I{X; Z). When ^ < a different bounding 
technique using only simple properties of the conditional expectation will be devised. 

Although no "sophisticated" tools are required to prove the converse, we show in the appendix that using some 
new results in information theory an alternative proof can be provided. This different proof hinges on the link that 
has been established between the mutual information (MI) and the minimum mean square estimation (MMSE) in 
Poisson channels Il24l . It is worth noting at this point that the link between the MI and the MMSE in the Gaussian 
setting II25I has been also used recently for different Gaussian wiretap channels |[26l . ETl . 

One of the distinctive aspects in this paper is that we do not resort to the A-discretization method introduced 

by Wyner |18|. This method was used to approximate the Poisson channel by a binary DMC thereby allowing 

the transposition of the widely known results for DMCs to the Poisson channel. This technique leads to extensive 

computations, especially when we are interested in the secrecy capacity as there are now two conflicting objectives 
^The time dependence has been dropped to ease the notations. Refer to the converse part of the paper for a mathematically precise 
statement. 
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involved, the maximization of the information rate at the legitimate receiver and the minimization of the information 
leakage at the eavesdropper. We circumvent the use of this method by using the techniques described above. 

The rest of this paper is organized as follows. The next section describes the setup of the problem and presents 
the main result of this paper as well as some interpretations of the obtained result. The proof of the achievability 
of the secrecy capacity is given in Section III and the proof of the converse is presented in Section IV. In Section 
V we extend the main result of the paper by giving a complete characterization of the rate-equivocation region. 
Finally, in section VI, some possible future directions are discussed. 

II. Problem and Result Statement 

The input process to the Poisson channel is a waveform denoted hy = {Xt,0 < t < T} satisfying Xt > 
for all t. We further assume that the input process is peak power limited, i.e., Xt < 1 for all t. The received signal 
at the legitimate receiver is a doubly stochastic Poisson process with instantaneous rate AyXt + Ay, i.e., given 
Xq the stochastic process Yff has independent increments with Yq = and for < s < t < T we have 



The parameter Ay > accounts for possible signal attenuation at the receiver. The parameter Ay > is the dark 
current intensity which results from background noise and bears no information on the input process Xq. Similarly 
the output process of the eavesdropper Zq is a doubly stochastic Poisson process with instantaneous rate AzXt + Xz- 

In this paper, the space of doubly stochastic Poisson processes on the interval [0, T] will be denoted by V{T). 
Following the notation used in ll24l the output process of the Poisson channel in the interval [0, T] with instantaneous 
rate aXt + A will be denoted by P(f (aX^f + A). We use {Xt)s to designate E[Xt\VQ{X^)], as such {Xt)t refers 
to the causal conditional mean estimate and {Xtjr to the noncausal one. 

All stochastic processes considered in this paper are defined on a common measurable space We use J"| 

to denote the internal history generated by the process ^g- 



Pr(y, -Ys = k\Xl) = T^(s,t)e-^(^-*), k G N, 



where 




In this paper we are interested in the degraded Poisson wiretap channel. Lapidoth et al. lIlTI gave conditions on 
the parameters {Au,Xu), u € {y,z} for stochastic degradedness. These conditions are presented in the following 
lemma. In order to prepare for the results to come we will also briefly go over the proof of this lemma. 

Lemma 1 (Lapidoth, Telatar and Urbanke Il2l]| '). The eavesdropper's channel is stochastically degraded with re- 
spect to the legitimate receiver's channel, if 



Ay > Az, 



(5) 



and 



Proof: Let (cf. Fig. 2) be the process defined as follows 



(6) 



Yt = Yt + Ht, te[0,T], 



(7) 



where Ht is a homogeneous Poisson process with rate X = — Xy (note that A > by ^) independent of 

{XQ,Yff). It follows that Iq is a doubly stochastic Poisson process with instantaneous rate AyXt + Aj^ + A = 
AyXt + x'-^z- The process Zq is then obtained from Yq by thinning with erasure probability \ — (note that 
because of dSjl this quantity is > 0). ■ 
In the rest of this paper we will assume that at least one of the inequalities ([5]) or Q is strict. Note that this 
assumption can be made without losing generality for if there was an equality in ((S) and then the legitimate 
receiver's channel and the eavesdropper's channel will be identical and the secrecy capacity will be zero. 
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Fig. 2. The degraded Poisson wiretap channel 



We move now to the description of the information transmission aspect of the problem. The transmitter wishes 
to communicate a message U uniformly distributed onU = {1, ...,M}. An {M,T) code {Et,Dt) for the Poisson 
wiretap channel is a stochastic encoder Et that maps a message U to a waveform X^ which satisfies the peak 



power constraint and a decoder Dt : V{T) hi. The transmission rate of this code is 

H{U) 1 
R = — ^ = - nM. 
T T 

The average probability of error at the legitimate receiver is 

M 

M 



1 



m=l 



The level of secrecy in this paper is measured by ^I{U ; Zq). This normalized mutual information quantifies the 
amount of information about the message U leaked to the eavesdropper. As such our goal is to make this quantity 
as small as possible. 

Definition A secrecy rate Rg is said to be achievable for the Poisson wiretap channel if for all e > and all 
sufficiently large T, there exists an (M, T) code such that 

InM 



^ >Rs-e 
Pe<e 

^I(U;Z^)<e (9) 

The supremum of achievable secrecy rates will be called the secrecy capacity. The main result of this paper is the 
following. 

Theorem 1. The secrecy capacity of the degraded Poisson wiretap channel is given bjc 

where a* is the unique solution in [0, 1] to the following equation 

{A,a* + X,)^' {A, + X,y'+^' X^"' 

This result assumes that {Az,Xz) is known to the transmitter. Yet it follows that Cg is an achievable rate with 
perfect secrecy even if the eavesdropper observes Vq{A'^Xq + A^), where A'^ and X'^ are unknown but satisfy 
A'^ < Az and X'-, > ^A^. Thus, only one-sided estimates of Az and A^ are needed. In practice, an upper bound on 
Az could be provided by guaranteeing that any potential eavesdropper is more than a certain distance away from 
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the transmitter. A lower bound on the dark current could be provided using ambient noise measurements and 
the known physical limitations of existing receivers. 

Worst case scenario: A particularly insightful case is when ^ = ^ = a. This situation happens when the 
eavesdropper observes a thinned version of the signal of the legitimate receiver, i.e., i^^ = in d?]). In this case, 
after some algebraic manipulations, we obtain that 

a = - a, (12) 

and the secrecy capacity reduces to 

Cs = {Xy - X,) (^^ (^1 + ^ -(l + a)ln(^l + i^^ . (13) 

This is saying that the secrecy capacity is the difference between the capacity of the main channel (the channel 
between the transmitter and the legitimate receiver) and the capacity of the eavesdropper's channel. For instance, in 
the special case when there is no dark current Xy = Xz = 0, we find that a* = ^ and the secrecy capacity reduces 
to 

Cs = (14) 

e 

For a degraded DMC, Wyner 121 showed that the secrecy capacity is equal to maXp^(/(X; Y) — I{X; Z)). Hence 
the following inequality is always satisfied 

Secrecy Capacity > Cm — Cw, 

where Cm is the capacity of the main channel and Cyy is the capacity of the eavesdropper's channel. As shown 
in ll28l . there is equality in the inequality above if there is an input probability distribution px that maximizes 
simultaneously I{X; Y) and I{X; Z). This is exactly what is happening here, when ^ = ^ the mutual information 
I{Xq; Yq") and I{Xq; Zq) are both maximized by letting the input Xq cycle infinitely fast between its extreme 

values, i.e., the peak power 1 and with Pr(Xt = 1) = 1 — Pr{Xt = 0) = a* = ^^^^J^ o". 

Before we proceed further with the presentation of the problem considered in this paper, we give a lemma that 
will prove to be useful in the proofs of the achievability and the converse, a proof of this result can be found for 
instance in |[T6l . 
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Lemma 2. The mutual infonnation between the input X"^ and the output ^^(aX^ + A) can be upper bounded as 
follows^ 

I{Xl, VUaXl + A)) < r{E[^{Xt)] - ^{E[Xt]))dt, (15) 

^0 

where 'd{x) = {ax + A) ln{ax + A). 

III. ACHIEVABILITY OF Cg 

Our achievability proof relies on the structured codes that were designed for the Poisson channel by Wyner 
ifTSl . Before delving into the details of the proof, we will briefly describe the code construction and the properties 
inherited by this code. 

Wyner codes W(T, M, A;): Let T, M and k be given, and construct an M x (^^^) binary matrix X as follows. 
The columns of X are the (^^) binary M-vectors with exactly k ones and M — k zeros. Now partition the interval 
[0,T] into {^) subuitervals of equal length wt — and construct M waveforms {Xj{m)}^^-^ as follows 

Xt{m) = X{m, n),t & ((n — 1)wt, uzut], n = 1, ( ^ ) • (16) 



For oi = jj fixed, these codes satisfy 

^IJi{t : Xt{m) = 1} = a, for all m, (17) 
with /i being the Lebesgue measure. If moreover M = e^^, for T >> 1, Wyner showed that for m ^ m' 

^^x{t : Xt(m) = l,Xt{m') = 0} « a(l - a). (18) 

As such for T large enough the codewords {Xq {m)}^^^ will behave as if they were chosen independently. 
After this brief overview of Wyner codes we are in a position to state the achievability theorem and prove it. 

Theorem 2. Any secrecy rate Rs < Cs is achievable. 

Proof: Let e > be arbitrary and let Rg = Cs — e. Define 

Ru = a*{Au + A„) ln(^„ + A„) + (1 - a*)A« In A^ 

- {A^a* + A„) ln(A„a* + A„), u G {y, z}. (19) 
''Note that some authors use the function i3{x) — {ax + A) ln{ax + A) — A In A instead but the constant term A In A cancels out here. 
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After few algebraic manipulations, we can show that 

Cs = Ry ~ Rz- 

Given these parameters, the encoder-decoder pair considered here works as follows. 

Encoding: Let M = e^"^ and let U be uniformly distributed onU = {1, ...,M}. Define My = e^^""!^)^ and 
following the steps described above construct a codqj C = >V(T, My, a* My). Partition this code arbitrarily into M 
smaller subcodes, i.e., C = ufliCi. The cardinality of each each subcode Ci will be equal to Mz = ^ = e*^^""^)^. 

The encoder works as follows, when the message U = m is chosen, the codeword is selected uniformly 
randomly from Cm- 

Decoding: The decoder considered here is the maximum likelihood decoder constructed by Wyner ifTSl . After 
observing Y^, the decoder at the legitimate receiver computes the following metric 

/ dYt, (20) 

where Sm = {t e [0,T] : Xt{m) = 1}. Then Dt{Y^) = m if m maximizes with ties resolved in favor of 
the smallest m. 

Analysis of Pe- The fact that Pg — > 0, follows simply from the fact that Wyner codes with the peak power 1 
and average power a* are capacity achieving. 
Analysis of i/(C/;Zj): 

Notice first that for each m, the waveform X[[^(?7i) is piecewise constant. It follows that a sufficient statistic for 
making a decision is the number of arrivals during each subinterval ((n — l)ti7r, ntuj^], i.e., Z„ = ZnvjT~ ^(n~i)-coT' 
n = l,...,Ny with Ny = [J'^^^ ). Consequently, 

I(Xo^;Zo^)=/(X;Z) 
I{U; Zl) = I{U; Z) (21) 

where X = [Xi, Xj^^], = or 1 depending on the choice of the codeword and Z = [Zi, Zj^J. The 

equalities above follows from the fact that Z is a sufficient statistic. 

^Note that even if q* is not a rational number, it can be approximated arbitrary close by rationals. 
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As a result of Lemma 2, we have 



]))dt, 



(22) 



where (/)z{x) = {AzX + A^) ln{AzX + A^). Because of the uniform choice in the encoding scheme and in view of 
(fTTl ) we must have that Pr[Xt = 1] = 1 — Pr[Xt = 0] = a*, hence we have 



E[MXt)] = a>,(l) + (1 - a*)MO) 



and 



a* {A, + A^) ln{Az + Xz) + (1 - a*)A^ In A^. 



,(E[Xt]) = Ma*) = i^za* + Xz) HAza* + A, 



(23) 



(24) 



Consequently, we deduce that 



^I{X;Z) = ^I{Xl,Z^)<Rz 



(25) 



Notice that every subcode Cm can be viewed as a code for the eavesdropper's channel with Mz codewords and 
uniform prior distribution. Define 6m to be the probability of error for code Cm (l < m < M) with the (optimal) 
decoder described above and let 5 = Ylim=i ^rn- From Fano's inequality we have 



H{y.\Z,U = m)<H{5m) + 5m\nMz, 



(26) 



where H{p) = —p\n.{p) — (1 — p) ln(l — p) is the binary entropy. 

Since the codewords are uniformly distributed in each subcode, we deduce that H{'K\U = m) = In Mz- We 
conclude therefore that 



/(X; Z\U = m) = F(X|C/ = m) - H(X\Z, U = m) 



>\nMz- {H{5m) + 5m In Mz 



Averaging over U and by using the concavity of H{-) we find that 



I{X;Z\U) > In Mz- iH{5) +5 In Mz 



(27) 



(28) 
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Notice also that ?7 ^ X ^ Z form a Markov chain, i.e., 



^I{U- Z) = X; Z) - i/(X; Z\U) 

= i/(X;Z)-i/(X;ZlC/) 



(29) 



Combined with the last inequaUty this implies that 



i/(C/; Z) < i/(X; Z) - i In M, + + 5 In M,). 



(30) 



Inequalities (l25l) and (l30l) result in the following 



^/(C/;Z) < i?, - ^[InM, - + 51nM,)]. 



(31) 



As Mz = e^^' 2^"^, this gives 



(32) 



By the code construction described above, the codewords of every subcode Cm (1 < m < M) satisfy (ITtI ) and ([T8] ) 
(with a replaced by a*). These two conditions dictate the pairwise error probability of the codewords in Cm ifTSl . 
Since the overall error probability of the code Cm is governed by the pairwise error probability lITSl . it follows that 
every subcode Cm is capacity achieving for the eavesdropper's channel and as such 5m for m = 1, ...,M can be 
made arbitrarily small. Hence, by choosing T large enough, we can enforce that ^H{5) + 6{Rz — §)<§. The 
previous inequality shows therefore that j^I{U; Zq) < e and the desired secrecy condition is satisfied. 

This shows that any secrecy rate Rs < Cs can be achieved and completes the achievability proof. ■ 

IV. The Converse for the secrecy capacity 
Before delving into the details of the converse we need the following technical lemma due to Wyner |f29]| . 

Lemma 3 (Wyner li29J ). If T : O — > F is a random variable such that F is a finite set and Aq = {Aj , <t<T} is 
a given stochastic process, then we have 



I{T;A^) = H{T)-H{T\A^ 



(33) 



where H{r) is the usual entropy for discrete random variables and 




(34) 
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This lemma is standard when all the random variables have discrete alphabets however this extension is needed in 
this paper since we are dealing with continuous time stochastic processes. 

The converse theorem will be proved through a sequence of Lemmas. The first one gives an inequality that must 
satisfied by every encoder-decoder pair {Et, Dt)- 

Lemma 4. For every (M, T) code with rate R = we have 

R < TiT^) {l(^o:yo\Zl) + I{U; Zl) + H{P,)) . (35) 

Proof: Let U = Dx{Y^) denote the output of the decoder at the legitimate receiver, so that Pg = Pr(f7 ^ tj). 
We then have the following sequence of identities 

RT = InM = H{U) = H{U\Y^) + I{U;Y^) 

< H{U\U) + I{U-Y^) 

< H{Pe) + Pe In M + /([/; Yq^) , (36) 

the equality (a) follows from Wyner's lemma and the inequality (c) is an application of Fano's inequality. For the 
inequality (6), since U Yq — > [/ is a Markov chain we deduce thaj^ I{U^Yq) > I{U,U). Now by invoking 
Wyner's lemma again it follows that H{U\Y^) < H(U\U). 

From Kolmogorov's formula (see Lemma 3.2 in ll29l ) we havel^ 

I{U;Y;[ ,Zl) = I{U;Y;^) + I{U-zI\Y;^), (37) 

since U — > Y^ Z^ is a Markov chain we deduce thai^ I{U ; Z'^\Yq) = 0. By applying Kolmogorov's formula 
again we obtain 

/([/; Fo^) = /([/; Fo^, Z^) = I{U; Z^) + /([/; Y;[\ZI) 

< I{U; zl) + l{Xl- Y^\Zl), (38) 

where the last inequality follows from the fact that U — ^ Y^ Zj form a Markov chain. Combining this 

last inequality with (c) and rearranging the terms yields the desired inequality. ■ 

*The data processing inequality extends to arbitrary random variables, see for instance Theorem 3.4 in 1291 . 
^The definition of the conditional mutual information for arbitrary random variables can be found in 1291 . 
^Refer to Lemma 3.1. in 1291. 



s 



16 

Lemma 5. If E \XtlnXt\ < oo, then 

I{Xl- Y^\Zl) = liX^; Y^) - I{Xj; Z^) (39) 

Proof: Applying Kolmogorov's formula twice gives 

I{X^- Y^, Zl) = I{Xl, Zl) + l{Xl- Y^\Zl) 

= I{Xl, Y^) + I{Xl- Zl\Y^). (40) 

Since X^ Y^ — > Zq form a Markov chain, we have I{Xq] Zq\Yq ) = 0. Consequently we deduce that 

I{Xl; Y;^) = I{Xl, Zl) + liXl; Y^\Z'l). (41) 

The condition E \Xt InX^l < oo impUes that I{Xq; Zq) < oo and it follows that /(X^; Y^) - I{Xq] Zq) = 

i{xl,Y^\zl). m 

The goal of the upcoming lemmas is to prove that I{Xl;Y^\Zl) < TCg, where Cg is given by ( fTOt . We first 
decompose I{Xq -jY^lZj) as follows 

IiXl,Y,^\Zl) = I{Xl,Yl) - I{Xl, Z^) 
= I{Xl,Yl)-I{Xl;Y^) 

+ I{xI,Y;^)-I{xI;Z];), (42) 

where Y^ has been defined in (|7]l. The next two lemmas will provide upper bounds on I{XI;Yq) — I{Xq;Yq) 
and I{Xl,Y^)-I{X'l;Zl). 

Lemma 6. If E j'^ \Xt\Q.Xt\ < oo, then 

i{xI-yI)-i{xI,y;[)< 

j^(0,(E[Xt]) - n<pz{Xt)]) - {(t^yinxt]) - n<t^y{Xt)])^ dt, (43) 

where (j)y{x) = (AyX + A^,) ln{AyX + Ay) and (pzix) has been defined above analogously. 
Proof: Note first that HB, 

liX^; Y^) = r inM^t)] - E[<PyiE[Xt\Ti^])]) dt, (44) 

^0 



^ ^ A 
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and 

fT 



I{Xl- Fo^) = ^ {^[x{Xt)] - nx{nXt\T'y])]) dt, (45) 

where x{^) = i^yX + ^^z) \n{AyX + ^A^). Consequently, using the fact that x{x) = ^<pz{x) + ln{^){AyX + 
^Az) and after simpUfications, we deduce the following 

rp 

I{Xl,Y^)-I{Xl,Y^)= [ {E[MXt)]-nMmt\J'i^])])dt 

Jo 

A F / \ 

-^j^ [nMXt)]-nMnxt\j'f])])dt. (46) 

Recall that Y^f = Y^ + H^, where is a homogeneous Poisson process independent of {Xq',Yq'). Clearly, 
J|, C V J^, with V = a{J^ U J^) being the smallest sigma-field containing U J^. From the 
independence of {Xj,Y^) from H^, using the law of redundant conditioning (see, e.g. f30', pp. 281-282]), we 
deduce that 

E[Xt| V Tjj] = E[Xt\Tlr] a.s. (47) 

We can now establish the following sequence of identities 

E[MnXt\T'Y])] EiMnXtlT'y V Tjj])] (48) 

E[E[MnXt\T'y V (49) 

> E[ME[E[Xt\Tl. V T'h]\TI])] (50) 

EiMnXtlJ^'y])], (51) 

where (a) follows from (|47] ). (6) follows from the smoothing property of the conditional expectation, (c) from 
Jensen's inequality applied to the convex function (pzi') and (d) from the fact that C Ty V J")^ and the 
smoothing property. 

We deduce therefore that 



I{X^;Y,^)-I{X^;Y,^)< / {E[cPy{Xt)] - E[^y{E[Xt\T'y])]) dt 

Jo 

A 

- / {E[MXt)] - E[ME[Xt\T^])]) dt. (52) 

Jo 
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A simple derivation shows that the function Tr{x) = </>y(x) — ^(p^ix) is convex as 

^ ^ (AyX + Xy){A^X + \,) - 

Now invoking again Jensen's inequality we obtain that 

E[7r(E[Xt|7t'])] > vr(E[E[Xt|J-f]]) = 7r(E[Xt]). (54) 

Using this last inequality and after rearranging the terms we obtain the desired result, i.e., 

liXl, Y^) - I{Xl; Y^) < r {E[^yiXt)] - Mn^t])) dt 

Jo 

r inM^t)] - Mnxt])) dt. (55) 

^2 Jo 



An alternative proof of this lemma using the link provided in [24] between the MMSE and the mutual information 
in Poisson channels is given in Appendix A. 

Lemma 7. JfE \XtlnXt\ < oo, then 

I{Xl,Y,'^)-I{Xl,Z^) 

< (4^ - 1) rinMXt)] - Mnxt]))dt (56) 

Jo 

Proof: Recall that Zq was obtained from Y^ by thinning with erasure probability 1 — Let the process 
Zq denote those points that were erased, hence we have that Zj is a doubly stochastic Poisson process with 
instantaneous rate {Ay — Az)Xt + — 1)A2. Moreover Z^ and Z]^ are independent given X^. We proceed with 
the proof of the lemma by showing that the following inequality holds 

I{Xl, Y^) - I{Xl, Zl) < I{Xl, Zj). (57) 

Indeed, notice first that Xq {Z^, Z^ ) Y^ is a Markov chain, hence from the data processing inequality we 
deduce that 

I(Xo^; Yo^) = I{Xl, Zl + ~Zl) < liXl; Z^, Z^). (58) 

Consider now two partitions of $7, Qz = C /"J and = {Bj}^^-^ C /"T. Define two discrete random 

variables D and l) on as follows D{uj) = i if u £ Ai and D{uj) = j if uj G Bj. The mutual information 
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I{Xj; Z^, Zl) can be computed as l|29l 

I(Xl-Zl,Zl)= sup l{Xl-B,B), (59) 

Q.z,Qz 

where the supremum is taken over all such partitions of Vl. We proceed to prove (|57l ) as follows 

I{Xl- D, D) H{D, D) - H{D,D\Xl) 

< H{D) + H{D) - H{D,D\Xl) 

H{D) + H{D) - H{D\Xl) - H{b\Xl) 

I{D;Xl)+I{D-X^), (60) 

where (a) follows from Lemma 3 (Wyner's lemma) applied to the random variable {D,D), (d) is a also a direct 
instance of this lemma. The inequality (6) is the independence bound on the entropy (which holds here since the 
random variables D and D are discrete). The equality (c) results from the fact that D and D are conditionally 
independent given X^, indeed D € /"J whereas D S /"J and and J^J are conditionally independent given 
J"^. Consequently we have 

I{X^;Zj,Zj)= sup I{Xj;D,D) 

Qz,Qz 

< sup {I{Xl-,D) + I{X];-b)) 

Qz,Qz 

= I{Xl,Zl) + I{Xl,Z];). (61) 
Combining the last inequality with dSSl ) we deduce thaj^ 

I{Xl, Y^) - I{Xl; Zl) < I{Xl; Zl). (62) 

Now using Lemma 2 we have 

l{Xl- ~Zl) < r{E[^iXt)] - ip{E[Xt]))dt, (63) 

where 

(^(x) = {{Ay - A,)x + {^- H{Ay - A-^x + - 1)A,). (64) 

Note that since E/jf \Xt \nXt\< oo, then I{Xj; Zq) < oo and hence the inequality is well defined. 
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Notice now that 

^{x) = {^- 1)M^) + (4^ - 1) ln(4^ - l){A^x + A,). (65) 

^z 

Plugging this identity in the inequality above, the linear term in x disappears and we are left with the inequality 
presented in the lemma. ■ 
An alternative proof of this lemma using the link provided in [24] between the MMSE and the mutual information 
in Poisson channels is given in Appendix B. 

Theorem 3. IfE \XtlnXt\ < oo, then 

^I{X^;Y,^\Z^)<Cs (66) 
Proof: Combining (l42l) and the result of the two previous lemmas yields 

I{X^; Yif) - I{Xl- Zl) < f{nK{Xt)] - K{nXt]))dt, (67) 
where K{x) = 4>y{x) — (t)z{x). A straightforward computation shows that 

{AyX + Xy){A^X + X^) 

since Ay > Az and XzAy > XyAyAz > XyA^ we deduce that K"{x) > 0. Moreover due to the assumption that 
at least one of the inequalities (l5]l or (|6ll is strict, we conclude that K"{x) > (for x > 0) and K{-) is strictly 
convex. 



Notice now that we have 



1 
T 



T 

{nK[Xt)\ - K{nXt]))dt 







< max max / K{x)p{dx) — K{a) 

0<a<l \p:r^xp(dx)=aJo 



0<a<l \p:f^xp{dx)=aJo 

= max {aK{l) + (1 - a)K{0) - K{a)) , (69) 

0<Q!<1 

where (a) follows from fixing K[Xt\ = a and maximizing over all distributions p{x) on [0, 1] with mean a. Equality 
{b) follows from the convexity of K{-) (refer to lfT6l and EH), i-C-, the maximizing distribution p puts all the mass 
on the extremes {0, 1} and since the mean is a, the maximizing p assigns the mass a to 1 and 1 — a to 0. 
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The maximization of the last term shows that the optimal a* is the unique solution to the equation 

K'{a*) = K{l)-K{Q), 

which, after some algebraic manipulations, gives that a* is the solution to (ITTI ). The existence of a* follows simply 
from the mean value theorem, whereas the uniqueness is a consequence of the strict monotonicity of K'{x). 
Consequently, the following is true 

^ j\m{Xt)]- K{nxt]))dt 

< a*K{l) + (1 - a*)K{0) - K{a*) 



«-(A,-^.) + ,„y- +l„^|^^J^^). (70) 



This fact when combined with (1671) gives the result announced in the theorem. ■ 
We are now in a position to prove the converse theorem. 

Theorem 4 (Converse). IfRs is an achievable secrecy rate then Rs < Cs- 

Proof: Since the secrecy rate Rs is achievable then for all < e < ^ and sufficiently large T, there exists an 

(M, T) code such that ^-^M. > R^ - e, Pe < e and ^/(C/; Zj) < e. Hence we have 

hiM 
Rs<-^ + e 

- l-Pe\ T T J 

W 1 / H(e)\ 

<- C, + e + ^ +e, (71) 



1-e 

where inequality (a) follows from Lemma 4, inequality (6) from Theorem 3 and inequality (c) from the properties 
of the code. Now since e is arbitrary, letting e ^ yields Rs < Cg- ■ 



V. Rate-Equivocation region 

In this section we turn our attention to the rate equivocation region of the degraded Poisson wiretap channel. The 
level of ignorance of the eavesdropper about the transmitted message U will be measured here by the normalized 
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equivocation given by 



_ H{U\Zl) 



Definition A rate-equivocation pair {R, d) is said to be achievable for the Poisson wiretap channel if for all e > 
and all sufficiently large T, there exists an (M, T) code such that 

InM „ 

T - 

Pe < e 

The following theorem gives the rate equivocation region for the degraded Poisson Wiretap channel. 

Theorem 5. The rate-equivocation region is the set of all rate-equivocation pairs {R, d) for which there exits some 
a G [0, 1] such that 

< all! ((Aj, + XyY-^^y'^ + (1 - a) In (a^«) - In {{AyOt + Ay)^''"+^'') (75) 
d < 1 (76) 

To ease the notations, using the functions K{-) and (/'y(-), we can rewrite the two first inequalities as Rd < 
aK{l) + (1 - a)K{0) - K{a) and R < a<j)y{l) + (1 - a)0y(O) - (pyia). 

Proof: The main ingredients needed to prove this theorem has been already used to obtain the secrecy capacity. 
More specifically, for the achievability proof we will use stochastic encoding combined with Wyner codes for the 
Poisson channel, and for the converse we will use the key inequality (l67l ) established by Lemma 6 and 7. 

A. Direct result 

Note first that for a fixed rate R, if the rate equivocation pair {R, d) is achievable then the pair {R, d) is achievable 
for all < d < d. Hence, in order to establish the direct result, it is enough to prove that any rate-equivocation 
pair {R, d) satisfying Rd = aK{l) + (1 - a)i^(O) - K{a), R < acjjyil) + (1 - a)(l}y{<d) - (l)y{a) and d < 1 for 
some a G [0, 1] is achievable. 
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Define 

Ru = a0«(l) + (1 - a)0„(O) - (/.„(a), u G {y, z}. (77) 

Let e > be arbitrary (small enough) and let R = with d < 1 and i? < a(Ay(l) + (1 - a)<Ay(0) - (l^yia). 

The message [/ to be transmitted is selected uniformly randomly from lA = {1,...,M} with M = e^^. Define 
My = e^^""^'^?)^ and, following the steps described for the achievability of the secrecy capacity, construct the 
Wyner code C = W(T, My,aMy). Partition this code arbitrarily into M smaller subcodes, i.e., C = ufl-^^Ct. The 
cardinality of each subcode Cj will be equal to Mz = ^ = e^^^~^~^'^~^'^ . Notice that with this choice of 
parameters we have 



^\nMz = Ry - R - 3e^ < Ry - Rd - 3e| = R^ - (78) 



The probability of error of the legitimate receiver can be made less than e because the Wyner code C can achieve 
the rate Ry. 

The equivocation of the code C can be lower bounded using the same steps used to established the upper bound 
on I{U; Zq) for the secrecy capacity, as follows 

H{U) RT ^ ' 

> 1 - ^ + ;^lnM, - + <51nM,) (80) 

Ry Ry — R — 36-§ 1 , ^ „x 
= 1 - ^ + ^^-^ - +5lnMz) (81) 

€ 1 , , , R, € , 

In the above, inequality (a) follows from (|3T]) and ^ = jj X]m=i where 5m is the probability of error for the 
code Cm (1 < m < M) with the (optimal) decoder described previously. 

As was discussed before, the term -^H{6) + — |) can be made less than | for T large enough, which 
means that 

This establishes that the rate-equivocation pair {R, d) is achievable. 
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B. Converse 

For every (Af, T) code with rate Rt = and equivocation At = ^^^^^^ ^ we have 

TRtAt = H{U\Z^) = H{U) - I{U; Z^) 

= H{U\Y;[) + /([/; Fo^) - /([/; Z^f) 
<if(C/|?7) + /(C/;yo^|Zo^) 

< /7(Pe) + Pe In M + I{X^- Y^\Z^). (84) 
From Lemma 6 and 7 (cf. (l67l) ) we have that 

I{Xl- Y;[\Z^) < riE[KiXt)] - KiE[Xt]))dt. (85) 

^0 



Consequently, we deduce that 

rT 
10 



r^At < + ^ j\^K{Xt)] - K{¥.[Xt]))dt (86) 

< + aK{l) + (1 - a)K(O) - K(a), (87) 



with a = y /Q^E[Xt](it and the last inequality follows from the convexity of the function K{-). Note that since 
< < 1 it follows that < a < 1. 
Similarly, we have that 

Rt = '^ = ^H{U\Y,^) + ^I{U-Y,^) 
<^H{U\U) + ^I{Xl,Y^) 

< ^iH{Pe) + P.lnM) + i j\n<t>y{Xt)] - <t>y{nXt]))dt 
W H(P ) ^ P In M 

< ^ Z^' + ac/>,(l) + (1 - a)^y{<d) - ^y{a), (88) 

where (a) follows from Fano's inequality and Lemma 2 and (b) follows from the convexity of the function 4>y{-)- 
Assume now that {R, d) is achievable, then for all < e < | and all sufficiently large T, there exists an (M, T) 
code such that Rx > R — e, < e and Aj- > d — e. By definition Aj- < 1, and hence d < 1 + e and in light of 
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the previous inequalities we have 



{R-e){d-e) < 



{R-e)< 



H{e) + elnM 
f 

H{e) + elnM 
f 



+ a<py{l) + (1 - a)(py{0) - (py{a). 



+ aK{l) + (1 - a)K{0) - K{a) 



(89) 



(90) 



Now since e is arbitrary, letting e — )• yields the desired result. 



VI. Conclusion and discussion 



Motivated by the practical advantages of optical communication over RF for secure communication, we have 
derived the secrecy capacity and characterized the rate-equivocation region of the degraded Poisson wiretap channel. 

Several interesting problems remain open and deserve further investigation. One is the non-degraded Poisson 
Wiretap channel. One can imagine a situation in which the eavesdropper is equipped with a powerful detector 
characterized by a negligible dark current (i.e., = 0). If the detector of the legitimate receiver has a higher 
received power from the transmitter but is more prone to dark current, then the channel will not be degraded. This 
is a practically-important situation but is not covered by the results of this paper. 

Another issue that we have not considered is fading. Indeed, for wireless optical communications, atmospheric 
turbulence can induce random fluctuations of the intensity of the transmitted Light beam ||23l . which creates fading 
and complicates secure communication. Note, however, that this fading is fundamentally different from multipath 
fading and is more manageable from the standpoint of achieving secure communication. 

MIMO Poisson channels have received some interest lately (see ll32ll and the references therein), and as has been 
done in the Gaussian setting, it would be interesting to see the impact of having multiple antennas on the secrecy 
capacity in the Poisson regime. 

We believe that the results derived in this paper and the tools used to derive them could be used to address these 
problems. 
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Appendix A 
An MMSE Proof for Lemma 6 

In this first appendix, we provide an alternative proof for Lemma 6. Tliis proof uses the link established in ||24]| 
between the MMSE and the mutual information in Poisson channels. Note first that since E {XtlnXtl < oo, 
we have that I{Xl;VQ{AyX^ + A)) is differentiable and Theorem 3 in 1241 states that 

d 



^^liXlXi^yXo +X)) 
rT 



[ E{ln{AyXt + X)-ln{AyXt + X)T}dt. (91) 

^0 



Notice now that 



IiXl,Y,^)-I{X^-Y,^) 
r-A„+A 



/ " ^HXj; VliAyXl + \))d\. (92) 



Therefore 



I{XI,Y;^)-I{XI,Y^) = 

{¥.{\n{AyXt + \)T]-HMAyXt + \)])dtj d\. (93) 
Since the function In(-) is concave, using Jensen's inequality and the iterative conditioning property we have 

¥.{\n{AyXt + A)t} < \nn{AyXt + \)t] = HAyK[Xt] + A). 
Making use of this inequaUty and the fact that Ay + A = ^ A^ we deduce that 

i{xi,y;^)-i[xi,y;[)< 



\x\^{Ay^Xt\ + A)dA-E{ / \x^^{AyXt + X)d\} dt, (94) 

where we have also invoked Fubini's theorem to make the necessary exchanges between the integrals and the 
expectation operator. The desired inequality is then obtained after some algebraic manipulations using the elementary 
identity 

/mA,. + AMA = (A.. + A)1„(A„. + A)-A. (95) 
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Appendix B 
An MMSE Proof for Lemma 7 

Here we provide an alternative proof for Lemma 7. For ease of notations define Wt = AyXt + ^^z- Using 
Theorem 4 in [24] we obtain that 

^/(l^o^; V^iaW;[)) = ^ E[Wt ln{aWt)]dt - ^ E[E[Wt\V^ (aWif)] ln{E[aWt\V^ {aW^)])]dt 

= r nWtlnWt]dt - rE[E[Wt\V^iaW;[)]lniE[Wt\VjiaW;[mdt, (96) 
Jo Jo 

where the second equality is obtained after some simplifications using the identity E[E[Wt\Vj (aW^)]] = E[Wt]. 
Now by the convexity of the function C(x) = xln(x), Jensen's inequality gives 

E[CiE[Wt\Vj{aW;[m > C{E[E[Wt\vUaW;f)]]) 

= C{E[Wt\). (97) 

It follows therefore that 

^I(W^;Vj{aW^))< [ E[WtlnWt]dt- [ E[Wt]\nE[Wt]dt. (98) 
da Jq Jo 

Clearly we have that I{W^;V^{aW^)) = I{X^;V^{aW^)). Also we have that = Vj{W^) and = 

Voi^W^). Consequently 

I{X^; Y^) - I{Xl, Z^) = |2 ^^«; ^(f (99) 

Ay 

Using the previous inequality, we conclude that 

I{Xj;Y;f)-IiX^;Zj) < (^j^ E[Wt In Wt]dt - E[Wt]lnE[Wt]d?j da 

Ay 

- r{AyE[Xt] + ^X,)\n{AyE[Xt] + ^Xz)dty (100) 

JO ^ ^ / 

After some simplifications, the last inequality gives the desired result, i.e., 

/(X^f; Yo^) - liXl, Zl) < - 1) mAXt)\ - 4>Amt\)) dt. (101) 
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